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Abstract 

For finitely generated nilpotent groups, we employ Mal’cev coordinates 
to solve several classical algorithmic problems efficiently. Computation of 
normal forms, the membership problem, the conjugacy problem, and com¬ 
putation of presentations for subgroups are solved using only logarithmic 
space and quasilinear time. Logarithmic space presentation-uniform ver¬ 
sions of these algorithms are provided. Compressed-word versions of the 
same problems, in which each input word is provided as a straight-line 
program, are solved in polynomial time. 
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1 Introduction 


Algorithmic properties of finitely generated nilpotent groups have been exten¬ 
sively studied, and many algorithmic problems in these groups are known to 
be decidable. However, in most cases neither the computational complexity of 
the existing algorithms nor the exact complexity of the problems themselves is 
known. Some of the earlier results in this area were obtained before complexity 
issues became a concern in algebra, while others were mostly focused on prac¬ 
tical aspects of computing. Lately, ideas from space complexity are exerting a 
strong influence on computations in algebra, and in particular there is an active 
research interest in logarithmic space computations. Another influence of com¬ 
puter science on modern algebraic computing concerns data representation in 
compressed form and its use in developing algorithms with lower computational 
complexity. Polynomial time, logarithmic space, and compressed-word compu¬ 
tations are now the main players in modern algorithmic group theory. We will 
elaborate their roles in §1.2-§1.4 below. 

In this paper we study the computational complexity of several fundamen¬ 
tal algorithmic problems in finitely generated nilpotent groups. These prob¬ 
lems include computing normal forms (Mal’cev coordinates) of group elements 
(hence deciding the word problem), deciding the conjugacy and membership 
problems, computing kernels of homomorphisms, and finding presentations for 
finitely generated subgroups. We prove that in a fixed group all these problems 
are computable in space logarithmic in the size of the input and, simultaneously, 
in quasilinear time. 

For the decision problems above (i.e., word, conjugacy, and membership 
problems), our algorithms also solve the ‘search’ version of the problem, mean¬ 
ing if the algorithm answers “Yes” it also provides a witness to its answer, 
in logarithmic space and quasilinear time. For the word problem, this means 
writing a trivial element as a product of conjugates of relators; for the conju¬ 
gacy problem, finding a conjugator; for the membership problem, expressing the 
given element as a product of subgroup generators. 

We also consider compressed-word versions of these problems, in which each 
input word is provided as a straight-line program (compressed word) producing 
a word over the generating set. We solve all compressed versions in time quartic 
in the input size. 

All our algorithms can be executed uniformly, meaning the nilpotent group 
may be given by an arbitrary presentation as part of the input, but in general 
this will invalidate the complexity assessment as the nilpotency class and num¬ 
ber of generators play a role in the complexity bound. However, if both the 
number of generators and the nilpotency class are bounded, the algorithms run 
in logarithmic space and polynomial time, with the degree of the polynomial 
depending on the bound. 
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1.1 Known approaches and summary of new results 

An early example of the study of algorithmic problems in nilpotent groups is the 
work of Mostowski [55] , who provided solutions to several algorithmic problems 
(word, membership, finiteness problems) and further expressed hope that the 
algorithms are practical for carrying out on a digital computer. As another 
example, while the word problem in nilpotent groups has long been known to 
be decidable, it was only established comparatively recently [T5] that it is in 
fact decidable in real (therefore, linear) time. In 1958, Mal’cev [53] investigated 
finite approximations of finitely generated nilpotent groups, which allowed him 
to prove that they have decidable membership problem. In 1965, Blackburn [5], 
using the same method, showed decidability of the conjugacy problem for the 
same class of groups. Such separability arguments, while sufficient to show 
that the corresponding decision problems are solvable, offer, by themselves, no 
reasonable estimates on the time or space complexity of the algorithms involved. 
Only very recently, certain bounds for so-called full residual finiteness growth 
of nilpotent groups were established [3]. 

We would like to point out two more established approaches to algorithmic 
questions in nilpotent groups. One is based on the fact that every torsion-free 
nilpotent group embeds into a linear group UT£j(Z) (see for example [10] or El)- 
This easily solves the word problem, and was used by Grunewald and Segal in 
1980 [S] to solve the last (at the time) standing major algorithmic problem in 
nilpotent groups, the isomorphism problem. It is worth mentioning that such 
an embedding shows that the word problem for torsion-free finitely generated 
nilpotent groups is, indeed, in LSPACE [18], but does not seem to provide 
any concrete time or space complexity estimates in the case of the conjugacy or 
membership problems. 

Another fruitful approach is due to the fact that finitely generated nilpotent 
groups admit a so-called Mal’cev (Hall-Mal’cev) basis (see, for example, [10] and 
|14jl. which allows one to carry out group operations by evaluating polynomials 
(see Lemma [2.2|1 . This approach was systematically used in [15], which gave 
solutions to a number of algorithmic problems in a class of groups that includes 
finitely generated nilpotent groups. It was also used in the above-mentioned 
work of Mostowski [28] . 

Mal’cev basis techiniques can be viewed as part of a more general picture. 
Indeed, every finitely generated nilpotent group G is polycyclic. To a poly¬ 
cyclic series one may associate a polycyclic presentation^ which in the case of 
nilpotent groups is closely connected to establishing a Mal’cev basis. Many al- 
gorithimic problems may be solved using such a presentation. This approach is 
described in detail in [33] and further studied in [121129], and in particular may 
be used to solve the membership and conjugacy problems in polycyclic groups. 
To our knowledge, no robust complexity estimates for such methods have been 
established in the case of nilpotent groups, with one exception of the recent pa¬ 
pers [251126] , which hnd polynomial bounds for the equalizer and membership 
problems. 

We follow the Mal’cev basis approach. Let G be a hxed finitely generated 
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nilpotent group. We describe algorithms to solve each of the following problems. 

(I) Given g € G, compute the (Mal’cev) normal form of g. 

(II) Given g, hi,... ,hn € G, decide whether g € {hi,..., h^) and if so express 
5 as a product of subgroup generators from a standardized set. 

(III) Fix another finitely generated nilpotent group, H. Given K = {gi,..., g„), 
a homomoprhism (j) : K ^ H specified by 4’{9i) = ^i-i h G Im(^), 
compute a generating set for ker(^) and find g G G such that (p{g) = h. 

(IV) Given K = {gi,..., g„) < G, compute a presentation for K. 

(V) Given g G G, compute a generating set for the centralizer of g. 

(VI) Given g,h G G, decide whether or not there exists u G G such that 
u~^gu = h and if so find such an element u. 

In every case, the algorithm runs in space 0{\ogL) and, simultaneously, in time 
0{L\og^ L), where L is the size of the input of the given problem. Problems 
©, and m in fact run in time 0{L\og^ L). To every subgroup one may 
associate a standardized or full-form generating set, and it is this set that is 
used in problem m to express g as a product of subgroup generators; we may 
in addition express g in terms of the given generating set, but the algorithm 
then runs in polynomial time (and not logspace). 

A Mal’cev basis consisting of m elements establishes a coordinatization of 
G, whereby each element is identified with an m-tuple of integers. The coor¬ 
dinates of a product gh are given by polynomial functions of the coordinates 
of g and h, and the coordinates of a power g^ are given by polynomial func¬ 
tions of I and the coordinates of g. Our algorithms work directly with the 
coordinate tuples and these multiplication/exponentiation polynomials. The 
key to obtaining logarithmic space bounds, and polynomial time bounds for 
compressed-word problems, is the fact that coordinates of an n-fold product 
3 i <?2 • ■ - ffn are bounded in magnitude by a polynomial function whose degree 
depends only on the nilpotency class c of G (a constant) and not, as may be 
inferred by composing the polynomials n times, on the length of the product 
fTheorem 12.31) . 

The class LSPACE of problems decidable in logarithmic space is defined 
via machines called logspace transducers, and we recall the relevant definitions 
in 111.21 Logarithmic space computations in groups have been studied primarily 
in relation to the word and normal form problems. In free groups, the word 
problem was solved in logarithmic space by |18j . Normal forms were computed 
in logarithmic space for graph groups and Coxeter groups in and the class of 
groups with logspace-computable normal forms was shown to be closed under 
several important constructions in [^. The conjugacy problem was also solved 
in logspace for free solvable groups, the Grigorchuk group, and certain wreath 
products in [34] . 

We also consider compressed-word versions of problems ©-dYD). In this 
formulation, every input word is given in the form of a straight-line program 
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(see i J1.3D and the input size L is the sum of the sizes of all input programs. A 
program of size L may encode a word of length up to 2^“^, and so efficient (i.e. 
polynomial time) algorithms must work directly on the straight-line program 
without producing the encoded word itself. We solve all of the problems ([T|- 
(IVII) . with compressed-word inputs, in time with (U), (jV)), and dYD being 

solved in time 0{L^). The approach is to first solve (P) to compute the Mal’cev 
coordinates of each input element, write each coordinate as a binary number, 
then apply the previous algorithms to the coordinate tuples. This also shows 
that a second ‘compressed’ formulation, in which every input word is given by 
its (binary) Mal’cev coordinates, can be solved in the same time complexity for 
each problem. 

The compressed version of the word problem is known to be polynomial¬ 
time decidable in several classes of groups, including free groups [20], partially 
commutative groups |19j . limit groups [22] . and nilpotent groups m-, further, 
polynomial-time decidability is preserved under many important group-theoretic 
constructions, such as graph products (see m for a summary). One motivation 
for obtaining a polynomial-time solution to the compressed word problem in a 
group G is that such a solution give rise to a polynomial-time solution to the 
(non-compressed) word problem in any finitely generated subgroup of Aut(G') 
and in semi-direct products involving G [32] . 

Less is known about the compressed conjugacy problem, though it is polyno¬ 
mial-time decidable in free groups [32] and more generally partially commutative 
groups these results being part of a polynomial-time solution to the word 
problem in the outer automorphism group of these groups. For the compressed 
membership problem, we are not aware of any previous results for interesting 
classes of groups. Even in free groups no polynomial-time algorithm is known, 
though recent results of Jez m on DFAs with compressed labels are closely re¬ 
lated. Compressed membership in abelian groups is easily solved in polynomial 
time by converting the straight-line programs to 0(n)-bit integer vectors and 
applying linear algebra techniques, and our proof for nilpotent groups uses a 
similar approach. 

1.2 Logspace 

We define logarithmic space computation via a machine called a logspace trans¬ 
ducer. Briefly, this is a deterministic Turing machine with three tapes: a read¬ 
only input tape, a read-write work tape with number of cells logarithmic in the 
size of the input tape, and a write-only output tape. We provide the details 
below. 

Let E be a finite alphabet containing a symbol e called the blank symbol. A 
tape is an infinite sequence X = {x„}„ with S E and all but finitely many 
Xn being the blank symbol. The subsequence consisting of all symbols up to 
and including the last non-blank symbol is called the content of the tape and 
the length of this sequence is the size of the tape. Intuitively, we think of it as 
a one-ended infinite array of cells, each cell holding an element of this sequence. 
To every tape X we associate a positive integer hx called the head position. 
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Let S' be a finite set, called the set of states. A configuration is a tuple C = 
(s, I, hi, W, hw, O, ho) consisting of a state s G S and three tapes I, W, O called 
the input tape, work tape, and output tape (respectively) together with the head 
positions for each tape. A transducer is a function which assigns to every possi¬ 
ble configuration C a successor configuration C = (s', hi', W, hw',0', ho') 
with the following properties: 

• C depends only on s and the symbols at hi on I and at hw on W ; 

• 1 = 1' and hi' differs from hi by at most 1; 

• W and W are identical except possibly at position hw, and hw' differs 
from hw by at most 1; 

• either ho = ho' and O = O' or ho' = ho + ^ and O' differs from O only 
in position ho- 

A run of the transducer is a finite sequence of configurations Ci,C 2 , ■ ■ ■ ,Ck 
where O^+i = O' for i = — 1, 0^ = 0^ (there is no further computation 

to perform), and the work and output tapes of Oi contain only the blank symbol 
p. The content of the input tape of Oi is called the input and the content of 
the output tape of Ck is the output. 

Let c > 0 be any integer. A transducer is called a c-logspace transducer if for 
every possible run the size of the work tape in every configuration is bounded 
by clogn where n is the size of the input tape (the base of the logarithm is 
not generally relevant, though using |I]| or |S| — 1 is natural). Provided such 
a constant c exists for a given transducer, it will be called simply a logspace 
transducer. 

Let S' be another alphabet. We say that a function / : S* —>■ (S')* is logspace 
computable if there is a logspace transducer which for every input w G T,* pro¬ 
duces f{w) on the output tape. A decision problem, which we define as a 
subset of S*, is logspace decidable if its characteristic function is logspace com¬ 
putable. The complexity class LSPACE consists of all decision problems that 
are logspace decidable. Note that in order to discuss multi-variable functions, 
we simply add a new symbol a to the alphabet and separate the input words 
by this symbol. 

Any function that is logspace computable is also computable in polynomial 
time (meaning the length of every run is bounded by a polynomial function 
of n). Indeed, in any run the sequence of configurations that follow a given 
configuration Ci are determined by Ci only. Hence no run may contain the 
same configuration twice since runs have finite length by definition. Thus, the 
length of any given run (and hence the time complexity of the machine) is 
bounded by the number of possible configurations 

|5|-n-|E|'=i°8”-clogn eO(n"+2), 

where n is the length of the input. Since the degree c -I- 2 of this polynomial 
can be expected to be quite high, it is usually advantageous to analyze the time 
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complexity of logspace transducers directly and obtain, if possible, a low-degree 
polynomial time bound. 

The type of computations that can be done in logspace are quite limited. 
For example, to store an integer M requires logM bits. Hence we can store 
and count up to n'^, but not higher. We may also store and manipulate pointers 
to different locations in the input, as each such pointer is an integer of size at 
most n. Basic arithmetic operations are also computable in logspace, and it is 
possible to compose logspace transducers (i.e., the class of logspace computable 
functions is closed under composition). 

The above is a formal description of logspace computability via transduc¬ 
ers, but in practice we simply work with informal algorithm descriptions and 
ensure that our algorithms require no more than logarithmic space. Each of our 
algorithms may be formally encoded as a logspace transducer. 

1.3 Compressed words 

Let E be a set of symbols containing a special symbol e used to denote the 
empty word. A straight-line program or compressed word A over E consists of 
an ordered finite set A, called the set of non-terminal symbols, together with 
for each A G A exactly one production rule either of the form A ^ BC where 
B,C G A and B,C < A or of the form A ^ x where x G E. The greatest 
non-terminal is called the root, elements of E are called terminal symbols, and 
the size of A is the number of non-terminal symbols and is denoted |A|. Note 
that any program A may, by encoding the non-terminal symbols as integers, be 
written down using 0(|A| • log |A|) bits. 

The output or evaluation of A is the word in E* obtained by starting with 
the root non-terminal and successively replacing every non-terminal symbol with 
the right-hand side of its production rule. It is denoted eval(A) and we similarly 
denote by eval(A) the word obtained starting with the non-terminal A G A. For 
example, the program B over {x} with production rules 

Bji y Bji—\B,^—i, Bji—i y Bji—2hdn—2^ ■ ■ ■ ^ ^ ^ 

has eval(B) = x^" and eval(i32) = x^. As this example illustrates, the 
length of eval(A) may be exponential in |A| (this program in fact achieves 
the maximum-length output). Thus to have efficient algorithms dealing with 
compressed words, one must avoid computing eval(A) and instead work directly 
with the production rules. A fundamental result of Plandowski [30] states that 
two straight-line programs may be checked for character-for-character equality 
of their outputs in time polynomial in the sum of the program lengths. 

Let us note here that for any fixed word w = X 1 X 2 • • • x^ over E, the word w" 
may be encoded with a program of size 0(log(n)). Indeed, we may first encode 
w using binary subdivision, i.e. with the scheme Wi Xj for i = I,...,m, 
Wm-yi W 1 W 2 , IFm-i -2 W 3 W 4 and so on, obtaining eval(H4) = w where k 
is some integer bounded by |w| -I- log 2 Iw]. A program similar to B above, with 
Wk in place of x and suitable modifications when n is not a power of 2, will 
encode w”. 
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A straight-line program over a group G is a straight-line program over a given 
symmetrized generating set of G. Any algorithmic problem for G which takes 
words as input may be considered in ‘compressed-word form’ where all input 
words are provided as straight-line programs. For example, the compressed 
conjugacy problem asks, given two straight-line programs A and B over G, 
whether or not eval(A) and eval(B) represent conjugate group elements. 


2 Computing Mal’cev normal forms 

To produce efhcient algorithms, our nilpotent group will need to be given by 
a particular type of presentation, known as a consistent nilpotent presentation. 
Such a presentation can be computed from an arbitrary presentation (Prop. 
1^ . During computation, we represent group elements in their Mal’cev normal 
form, which we define below. Critically, converting a general group word to 
Mal’cev form involves at most a polynomial expansion in word length (Theorem 
1^ . and may be performed in logarithmic space fTheorem l2.7l) . 

2.1 Nilpotent groups and Mal’cev coordinates 

A group G is called nilpotent if it possesses central series, i.e. a normal series 

G = Gi l> G 2 >...!> Gs \> Gs+i = 1 (1) 

such that GijGij^i < Z(G/Gi+i) for all i = l,...,s where Z denotes the 
center. Equivalently, G possesses a normal series in which [G, Gi] < G^+i for 
alH = 1,..., s. 

If G is hnitely generated, so are the abelian quotients G^/Gi+i, 1 < i < 
s. Let an,... ^airtii be a standard basis of Gi/Gi+i, i.e. a generating set in 
which Gi/Gi+i has presentation {an,..., airm ki/; J ^ Ti) in the class of abelian 
groups, where 71 C {1,... ,mi} and dj G !Z>o. Formally put e^- = 00 for j ^ %■ 
Note that 

A — luii, ai2, •. • 5 } 

is a polycyclic generating set for G, and we call A a Mal’cev basis associated to 
the central series ©• 

For convenience, we will also use a simplihed notation, in which the genera¬ 
tors Oij and exponents are renumbered by replacing each subscript ij with 
j -I- ^ m^, so the generating set A can be written as A = {oi,..., Om}- We 

t<j 

allow the expression ij to stand for 7 -b ^ rn^ in other notations as well. We 

i<j 

also denote 

T = {i\ei < cx)}. 

By the choice of the set {ai,... ,am\ every element g G G may be written 
uniquely in the form 
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where ai G h and 0 < ai < Ci whenever i G T- The m-tuple {ai,... ,am) 
is called the coordinate tuple of g and is denoted Coord(g), and the expression 
is called the (Mal’cev) normal form of g. We also denote ai = 

Coordi((7). 

To a Mal’cev basis A we associate a presentation of G as follows. For each 
1 < i < m, let ni be such that Oi G Gm \ Gm+i- If * G T, then a®* G Gm+i, 
hence a relation 



( 2 ) 


holds in G for some G Z and i > i such that G Gm+i- Let 

1 < i < j < m. Since the series o is central, relations of the form 



(3) 

(4) 



hold in G for some aijk, Pijk G Z and I > j such that ai, ..., Om G Gnj+i- 

The set of (abstract) symbols {ai,...,am} together with relators (l2])-(|4]) 
present a group G' that is isomorphic to G under the natural isomorphism 
(relator (U]) may be omitted when j G T). Indeed, any presentation on symbols 


{oi,..., am} with relators of the form Q for any choice of T and relators of the 


form ([3]) and Q for all I < i < j < m defines a nilpotent group G" with cyclic 
central series having terms (a^,..., am) for i = I,..., m. Such a presentation 
is called a consistent nilpotent presentation for G" if the order of modulo 
(oi+i,..., Om) is precisely Cj. While presentations of this form need not, in 
general, be consistent, those derived from a central series of a group G as above 
must be consistent (if in G' the order of Oi is e^ < Ci, then this fact follows from 
([2])-(01) and hence a relation for a}* would have been written in ([2])). Consistency 
of the presentation implies that G' ~ G. 

Proposition 2.1. There is an algorithm that, given a finite presentation of a 
nilpotent group G, finds a consistent nilpotent presentation of G and an explicit 
isomorphism. The presentation may be chosen to be the presentation derived 
from a Mal’cev basis associated with the lower or upper central series of G. 

Proof Prop. 3.2 of [1] proves that a given nilpotent presentation may be checked 
for consistency, so to produce a consistent nilpotent presentation it suffices to 
enumerate presentations obtained from the given presentation of G by finite 
sequences of Tietze transformations until a consistent nilpotent presentation is 
found (cf. [T] Thm. 3.3). 

Further, we may compute both the lower and upper central series from a 
presentation of G. Indeed, each term F^ = [G, Fi-i] of the lower central series 
is precisely the normal closure of the set {[ 5 , 7]}£(,7 where g and 7 run over 
generating sets of G and Fi_i, respectively, and as such a generating set may be 
computed by [1] Lem. 2.5. The upper central series is computed in Cor. 5.3 of 
the same work. From either series we may find a Mal’cev basis and then write 
down, via exhaustive search, the required relators (EJ-gl). □ 
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In Proposition 15.11 we employ techniques described in Section O to give a 
polynomial-time version of Proposition 12.11 

An essential feature of the coordinate tuples for nilpotent groups is that the 
coordinates of a product (a“^ ■ • • ) (af ^ ) may be computed as a “nice” 

(polynomial if T = 0) function of the integers ai,..., am, /3i, • ■ ■, Pm, and the 
coordinates of a power may similarly be computed as a ‘nice’ 

function of ai,, am and 1. The existence of such polynomial functions for 
torsion-free nilpotent groups is proved in m and and an explicit algorithm 
to construct them from a nilpotent presentation of G is given in m- 

If any of the factors Gi/Gi+i are finite (which must occur when G has tor¬ 
sion), the coordinate functions also involve the extraction of quotients and re¬ 
mainders modulo Ci. For each i & T, we define functions : Z ^ {0,1,..., — 

1 } and Si : Z —>■ Z by the decomposition 

t = n(t) + Si{t)ei, 

where t G h. Let Tq{n) denote the set of all functions / : Z" —>■ Z formed 
as a finite composition of the functions from the set {•,-b, ri, Si | i G 7”} using 
constants from Q, where • and -I- denote multiplication and addition in Q. 

Lemma 2.2. Let G be a nilpotent group with Mal’eev basis ai,..., am- There 
exist pi,... ,Pm G 7'q(2to) and qi,... ,qm G -b 1) satisfying the following 

properties. For every g,h G G and I G Z, writing Coord( 5 ) = ( 71 ,..., 7 ^) and 
CooTd{h) = (di,.. .,6m), 

(i) Coordi(g/i) =Pi( 7 i,.. .,^m,6i,.. .,6m), 

(a) Coordi(g') =( 3 'i( 7 i,..., 7 m,/), 

(Hi) if i G T then Pi = Vi o p[ and qi = ri o q[ for some pi G 7'Q(2m) and 
Qi € 7Q(m -b 1 ), and 

(iv) if "fi = 0 for all i < k, or 6i = d for all i <k, then for all i <k + \ 

(a) Cooidi{gh) = 'ji + 6i and Coordi(g*) = Iji if i ^ T, and 

(b) Coordi{gh) = ni'ji + 6i) and Coordi(g‘) = r^lji) ifiGT. 

Proof. This lemma is a special case of Theorem 5.7 of [27]. □ 

We note that the existence of such functions extends to nilpotent groups 
admitting exponents in a binomial principal ideal domain, as described in The¬ 
orem 5.7 of ^7\ . 

Computation of the functions Pi,... ,Pm, Qi, ■ ■ ■ ,Qm, in the case when all 
factors Gi/Gij^i are infinite, can be done via the “Deep Thought” algorithm m- 
If some factors are finite, one may introduce the functions r^ and L one factor 
at a time using a procedure similar to that used to compute normal forms given 
in §7.2 of [17]. It is worth observing that in this construction the values of the 
numbers Ptp, pLu, and are not essential: one may compute functions in 
which all of these values appear as variables. 
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For ^and 21 the remainder of this section, we fix G to be a finitely pre¬ 
sented nilpotent group. Set c to be the nilpotency class of G and fix a Mal’cev 
basis A = {oi,..., am} associated with the lower central series of G, with m the 
size of this basis. Algorithmic results in these sections do not take G as part 
of the input and so we may, in light of Proposition 12.11 and Lemma [221 assume 
that the presentation of G is precisely the consistent nilpotent presentation cor¬ 
responding to A and that the functions pi, qi are known (computed in advance). 
However, we state algorithmic results without restriction on the presentation of 
G with the understanding that such algorithms will, in pre-computation, trans¬ 
late to such a presentation if needed. In 21 we provide uniform algorithms, in 
which G is included in the input. 


2.2 Polynomial bound on the length of normal forms 

Suppose w = X 1 X 2 • • • is a word over A^. In order to compute the coordinate 
tuple of w, one may use the polynomials pi to compute the coordinates of X 1 X 2 , 
then use this result to find the coordinates of {xiX 2 )x 3 and so on. However, 
the resulting computation is an n-fold composition of the polynomials and thus 
we may a priori expect a bound of order /c” on the magnitude of the coordi¬ 
nates, with k being the maximum degree of the polynomials. This presents an 
obstacle to logspace computation, as the binary representation of integers that 
size requires linear space. We show that the coordinates are in fact of order n°, 
where c is the nilpotency class of G. The following result is, to our knowledge, 
folklore (our proof is adapted from unpublished lecture notes by C.Dru^u and 
M.Kapovich). 

Theorem 2.3. Let G be a nilpotent group with a lower central series G = 
Ti l> ... [>rc O Tc+i = {1} with an associated Mal’cev basis A = {an,..., acm„}- 
There is a constant k, depending only on the presentation of G, such that for 
every word w over A’^, 

ICoordy (w)| < K ■ [rcP (5) 

for all i = 1,... ,c, I < J < mi- 
Proof. We must show that if 


''ll 




7cm, 
’ (^crric 


1 < i < C, 1 < j < TTli, 


is the normal form of w, then | < k\wY. 

Note that since [ri,rj] < Ti+j, we have for each a = af/} and a' = a^/, 
^c, I<fc<mi, 1 < £ < rrij, a commutation relation of the form 


a,a = a. 


tl “t2 


■ • Oc 


( 6 ) 


where t > i j. Similarly, for each a = with ik G T, we have a relation 


Hi ^t 2 


UcTTlc 

^crtic 


(7) 


II 


where t > i. Put E = max{eifc|ifc G T} (or iJ = 1 if T =0). Let Co G Z be 
greater than the word length of the right hand sides of all equalities (ED, dZD. 
Note that Co only depends on the presentation of C. 

For any word v over and integer 1 < n < c, denote by |u|„ the number 
of occurrences of letters an, , aimi , ■ • ■, Oni, • ■ •, a-nm„ and their inverses in v. 
We also formally put |u|„ = 0 for n < 0 and \v\n = \v\c for n > c. 

Claim. For every 1 < fc < c there is a constant Ck+i that depends only on k 
and the presentation of G such that for every word w over the corresponding 
group element can be represented in the form 

O'!! ')kmu 

w =G a/i •••«*/••• -Wk+i, 

where 


(A) 1 < i < k, 1<J < rrii, 

(B) | 7 ij| < Cfc+i|ty|b and 

(C) 0 < 7 y < eij if ij G T, and 

(D) Wk+i G Ffe+i is a word in as^k+i)i, ■ ■ ■ Acm^ with \wk+i\k+t < Ck+i\w\^^^ 
for all fc + 1 < fc + £ < c. 


We prove this claim by induction on k. We allow fc = 0 as the base case, 
which holds with wi = w and Ci = 1. 

Suppose the claim holds for some k — 1 > 0. Denote = Wk and push an 
occurrence of in this word to the left, using the commutation relations (ED- 
This gives the expression 

( 0 ) ±1 ( 1 ) 

<+i =G flfei - Will- 

Notice that for a = and a' = the right-hand side i? of (ED satisfies 
\R\k+e = 0 for alH > £ and \R\k+e < Cq otherwise. Therefore swapping a'H 
with afj^ increases the word length, by at most Cq, only tor i = 1,... ,£. So 

\w^kli\k+e < \w^kli\k+e + Co\w^kliU- 


Notice that this inequality holds in fact for all £ G Z. 

We proceed in the same fashion to move left all occurrences of , followed 
by all occurrences of and so on. At step j -f 1 we represent =g 

^kij ■ "^k+P^ where {i^} is a non-decreasing sequence. We similarly get 

\Wk+i\k+t < \w^lli\k+t + Co\w^lll\i, ( 8 ) 

for all ^ G Ij. All letters are collected on the left in at most N < \wk\k < 
Ck\w\^ steps, which gives 


Wk =G 


• • a 


krrik 


W 


(AT) 

fe +1 


(9) 


12 


with w[^\ e Ffc+i. 

We immediately see by the induction hypothesis that | 7 fci| < \wk\k < Ck\w\^, 
which delivers (A) provided Ct+i is chosen to be at least Ck- We also find a 
bound on \k+t, for all fc + 1 < fc + £ < c, which will be used to prove (C), 
by applying ([H]) repeatedly. 


wi+\\k+e < + 




(7V-1)| 


< 


(k^+i \k+i + C'oki+i 1 ^) + C'o (ki+i 1 ^ + Co\wi^i \e-k) 




< 

< 


^k+1 


\k+e + • ■ ■ + 




= E 




C^\wi'lr'^\k+,-.k 


< 


< 


< Co/"^iV^Cfc|n; 


( 0 ) I ^1 

^/e +1 

k-\-£. — Lk 


c/k 

0 


E 


\'^k \k+£—Lk 




k+i—ik 


l<4 


k+e 


^ C'fc+i 

Before obtaining (C), we must first reduce certain exponents to obtain 
(B), then repeat the collection process described above. Consider the word 
al\^ ■ ■ ■ and, for each ki € T, rewrite using ([7|) 

^fci <5i-j / Si.j 

Hi =Hi -Kz ) =GHi 


where 0 < Ski < ^ki and Vi consists of copies of the right-hand side of ©. 
Note that 

l^zl < Co ■ |sj| < Co ■ |7fez| < CoCk ■ |u;|''. 

For ki ^ T, put Ski = Iki and = 1. Thus the resulting word 

- Su-t <5^771^. 

Wk 

has length at most 

N + rukCoCk ■ \w\'^ < Ck ■ \w\'^ + mkCoCk ■ \w\'^ = Ck ■ 


Repeating the collection procedure for the word Wfc, as above, we obtain that 


Wk =G aif 


Skrr. 

^km 


k 


Uk+l, 
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where, using (|S]) repeatedly as before, 


\Uk+l\k+i 


< 

^cjk 





< 

'-^0 

J2iGk\w\>^)^Wk\ 



‘<1 

< 

^cjk 

'-^0 

Y,ci\wfCk\w\>^ 




< 




for all fc + 1 < fc + ^ < c. Combining this with with ([H|) , we get 

(5fci (-^) 

=G 

Thus setting Wk+i = Uk+iw^^i and Ck+i = Ck+i TC'^+i delivers (C), complet¬ 
ing the inductive step and the proof of the claim. 

To prove the lemma it is only left to notice that the claim with k = c suffices, 
which gives k = Cc+i- CH 


Remark 2.4. Observe that the torsion part of the above argument (obtaining 
(B)) can be adjusted to allow group relations 


Eik /^i,fc + l /^i,fc + 2 


fJ-cm 

' CLcrric 


for a = a. 


±1 

ik 


in place of 0, by processing aki, ak 2 , ■ ■ •, akm^ in succession (in that order). 
Further, note that though we use the lower central series in Theorem 12.31 we 
only need the property 

[R,r,]<r,+,. ( 10 ) 

Therefore, any central series having this property will suffice, with c being re¬ 
placed by the length of the series. 


In light of the importance of Theorem 12.31 to our work, we will refer to 
any Mal’cev basis associated with the lower central series of G as a lower cen¬ 
tral Mal’cev basis. Our algorithmic results usually assume that a lower central 
Mal’cev basis of G is given, though by Remark l2.4l one may substitute a central 
series satisfying (fTOl) . If one instead uses a polycyclic basis derived from a cyclic 
central series (called simply a Mal’cev basis in the literature) a similar polyno¬ 
mial bound, albeit of a higher degree, takes place (recall that the functions pi 
and Qi are described in Lemma [2^ and m is the polycyclic length of G). 


Lemma 2.5. Let G be a finitely generated nilpotent group with Mal’cev basis 
A = {fli,... ,am\. There are a constants k' and 5 depending on pi, qi, and m 
such that for every word w over 

|Coordi(ri;)| < k' ■ |rc|'^ 


for all i = 1,... ,c. 
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Proof. Assume the lemma holds for all nilpotent groups having a cyclic series 
of length m — 1, in particular for G 2 = ( 02 , • ■ •, Om)- Write w in the form 

W = Wia]^W2ai'^ ■ ■ ■ WraJ''Wr+l, (11) 

where each Wj is a word (possibly empty) in letters ..., and r/j G 1i\{0} 
for all j. The proof then proceeds via a right-to-left collection process utilizing 
commutation relations to obtain a word of the form 

W = alw'^w '2 ■ • ■ w(,UIr.+ i, 

where 77 = Vr and ■ • ■ w'j.Wr+i is an element of G 2 . □ 

We use Theorem 12.31 over the above statement in subsequent arguments, so 
we leave the details of the proof to the reader. 

We would like to mention that our methods, generally speaking, do not 
extend to polycyclic groups. Indeed, while every polycyclic group has a set of 
polycyclic generators, below we show that no polynomial bound for coordinates 
similar to (O can be met unless the group is virtually nilpotent. 

Proposition 2.6. Let H he a polycyclic group with polycyclic generators A = 
{oi,... ,am}- Suppose there is a polynomial P(n) such that if w is a word over 
of length n then 

|Coordi(w)| < P{n) 

for all i = 1,2.,... ,m. Then iJ is virtually nilpotent. 

Proof. Without loss of generality, assume that P is monotone. Let Bn be the 
ball of radius n centered at 1 in the Cayley graph of H relative to A. Then for 
every w € Bn, |Coordi(i(;)| is bounded by P{n), so \Bn\ < {2P{n) + 1)™, i.e., H 
has polynomial growth. By a result of Gromov [S] , it follows that H is virtually 
nilpotent. □ 

2.3 Computation of normal forms and the word problem 

Since our algorithms will accept words of as input, but perform subsequent 
computations using Mal’cev coordinates, a necessary first step is to compute 
these coordinates. The following statement improves on a result in [0], which 
proved that UT„(Z) has normal forms computable in logspace. 

Theorem 2.7. Let G be a finitely generated nilpotent group with Mal’cev ba¬ 
sis A. There is an algorithm that, given a word w of length L over A’^, com¬ 
putes the Mal’cev normal form of w in space O(logL) and, simultaneously, time 
0{L log^ L). 

Proof. Denote w = X 1 X 2 ■ • ■ xl. We hold in memory an array 7 = ( 71 ,..., 7 m)j 
initialized to (0,..., 0), which at the end of the algorithm will hold the Mal’cev 
coordinates of w. First, we use the functions {pi}i to compute the coordinates 
of the product X 1 X 2 , storing the result in 7 . We then compute, again using the 
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{pi}i, the coordinates of the product {X 1 X 2 ) ■ X 3 , using the saved coordinates 
of xiX 2 - We continue in this way, performing m(L — 1) total evaluations of 
functions from and obtain the coordinates of w. 

Each subword xi ■ ■ ■ xj has length bounded by L, so by Theorem 12 . 31 each of 
its coordinates may be stored as an O (log L)-bit number and hence 7 may be 
stored in logarithmic space. Each evaluation of a function pi involves addition, 
multiplication, and division (needed to evaluate the functions and Sk) with 
0(logi)-bit inputs. The time complexity of these operations is (sub)quadratic 
in the number of bits, hence the overall time complexity is O(Llog^L). The 
standard ‘schoolbook’ arithmetic operations use space linear in the number of 
bits, hence space 0(log L) in this case. □ 

Remark 2.8. The time complexity is in fact 0{L ■ /(logL)), where f{k) is the 
complexity of multiplying two fc-bit numbers. Several bounds that are tighter 
than the bound obtained from ‘long multiplication’ are known. 

Theorem EH of course implies that the word problem in G is decidable in 
logspace. This was previously known via embedding into UT„(Z), since [IH] 
proved that linear groups have LSPACE word problem. 

Corollary 2.9. Every finitely generated nilpotent group has word problem de¬ 
cidable in LSPACE. 

Note that Theorem 12.71 does not give a solution to the search version of the 
word problem, that is, the problem of writing a trivial element re as a product 
of conjugates of relators. We give a polynomial-time solution to this problem in 
Theorem 14.21 

While the above ‘letter-by-letter’ application of the functions pi is efficient 
for the initial conversion of input words into coordinates, subsequent coordinate 
computations generally involve a product of a constant number of factors, the 
coordinates each of which are known, hence it is more efficient to apply the 
polynomials to the coordinates of the factors. 

Lemma 2.10. Fix a positive integer k. Given integer vectors vi,... ,Vk repre¬ 
senting the coordinates of group elements gi,... ,gk and integers ci,..., one 
may compute 

Coord(gii52" 

in time 0{L^) and space 0{L), where L is the maximum number of bits required 
to represent any of the Vi or any of the Ci, for i = 1,..., fc. 

Proof. Since k is fixed, this follows immediately from the fact that arithmetic 
operations are computable in quadratic time and linear space. □ 

2.4 Compressed-word problems and binary Mal’cev coor¬ 
dinates 

Theorem l2. 31 implies that there is a constant b, depending on G, such that every 
coordinate of a word of length L may be expressed as a 61og(L)-bit number. 
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Therefore every element g G G which may be, on the one hand, represented 
by a word of length L, may be more efficiently represented as an m-tuple of 
61og(L)-bit numbers. In this sense, the specification of a Mal’cev basis provides 
a natural compression scheme on G. 

In formulating algorithmic problems over G it is therefore natural to expect 
that input group elements are encoded in this scheme, similar to the expecta¬ 
tion that elements of Z are encoded as binary numbers rather than in ‘unary 
encoding’ as words. On the other hand, straight-line programs provide a general 
method to formulate algorithmic problems over groups with compressed input, 
and in the case of nilpotent groups the use of straight-line programs eliminates 
the need to specify a particular Mal’cev basis. 

The two schemes are in fact equivalent for nilpotent groups: a coordinate 
tuple {ai,... ,am) encoded using L bits is easily converted to a straight-line 
program of size 0{L) producing the normal form 0 ““^ and we show 

below that conversion is also efficient in the opposite direction. 

Theorem 2.11. Let G be a finitely generated nilpotent group with Mal’cev basis 
set A. There is an algorithm that, given a straight-line program A over A'^, 
computes in time 0{L^) the coordinate tuple Coord(eval(A)), where L = |A|. 
Each coordinate o/eval(A) in the output is expressed as an 0{L)-bit number. 

Proof. We may assume that A is a lower central Mal’cev basis. Let B be any 
non-terminal of A. Since |eval(i?)| < 2^, Theorem 12.31 gives the bound 

|Coordi(eval(i?))| < (12) 

for each i = l,...,m and so Coordi(eval(i3)) may be expressed as a 0{L)- 
bit number. For every i, we compute Coordi(eval(i3)) by induction. If the 
production for B has the form i? —x, we simply report 0 or ±1, as the case 
may be. 

Otherwise, we have B —> CD and we assume that Coord^ (eval(C)) and 
Coordj(eval(Zl)) have been computed for j = 1,... ,m, and each coordinate is 
an 0(L)-bit number. Then we compute Coordi(eval(i?)) by evaluating pi at 
Coord(eval(C)) and Coord(eval(D)). Since the inputs are 0(L)-bit numbers, 
evaluation of pi may be done in time (sub)quadratic in L. Repeating for each 
of the L non-terminals of A gives the 0{L^) bound. □ 

We can now approach compressed-word versions of various algorithmic prob¬ 
lems in G by converting straight-line programs to Mal’cev coordinates then 
applying algorithms which work with coordinates. The first of these is the com¬ 
pressed word problem. A polynomial-time solution to this problem was also 
observed in m, via reduction to UT„(Z), and a new result in [TB] shows, using 
the same reduction, that the compressed word problem in any non-trivial finitely 
generated torsion-free nilpotent group is complete for the logspace counting class 
C^L. 

Corollary 2.12. The compressed word problem in every finitely generated nilpo¬ 
tent group is decidable in (sub) cubic time. 
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Throughout this paper we describe several algorithms which take as input 
one or more words over a hnitely presented nilpotent group G. Each such 
algorithm also comes in two ‘compressed’ versions. 

In the ‘compressed-word’ version, all inputs and outputs are straight-line 
programs and the size L of the input is the sum of the sizes of all input programs. 
In the ‘binary Mal’cev coordinate’ version, G is provided with a fixed Mal’cev 
basis A and all input and output words are coordinate tuples (relative to A) 
with each entry written as a binary number. The size L of the input is the total 
number of bits in the input coordinates. 

In all cases, the compressed-word version works by first computing the 
Mal’cev coordinates of each input straight-line program using Theorem 12.111 
and then invoking the binary Mal’cev coordinate version. 

3 Matrix reduction, membership problem, and 
subgroup presentations 

Several algorithmic problems, including the membership problem, may be solved 
by constructing an integer matrix from coordinate tuples corresponding to the 
generating set of a subgroup and performing a process similar to Gaussian elim¬ 
ination over Z to reduce the matrix to a unique ‘standard form’. This approach 
was detailed in [33], but without a computational complexity analysis. We re¬ 
view this reduction process and analyze its complexity in ^3.11 apply it to solve 
the membership problem in ^3.21 and use it to compute presentations for sub¬ 
groups in iI3.31 It is also essential for computing kernels of homomorphisms and 
thereby solving the conjugacy problem in i[4| 

3.1 Matrix reduction 

Let hi,... ,hn be elements of G given in normal form by hi = , for 

i = 1,..., n, and let H = (hi,..., hn)- To the tuple (hi,..., h„) we associate 
the matrix of coordinates 



(13) 


and conversely, to any nxm integer matrix, we associate an n-tuple of elements 
of G, whose Mal’cev coordinates are given as the rows of the matrix, and the 
subgroup H generated by the tuple. For each i = 1,..., n where row i is non¬ 
zero, let TTi be the column of the hrst non-zero entry (‘pivot’) in row i. The 
sequence (hi,..., hn) is said to be in standard form if the matrix of coordinates 
A is in row-echelon form with no zero rows and its pivot columns are maximally 
reduced, i.e. if A satisfies the following properties: 

(i) all rows of A are non-zero (i.e. no hi is trivial). 
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(ii) TTi < 7 r 2 < ... < TTg (where s is the number of pivots), 

(hi) aiTT^ > 0 , for alH = 1 ,..., n, 

(iv) 0 < ak-Ki < cx-i-Ki, for all 1 < fc < i < s, 

(v) if TTi € T, then ai.^^ divides for i = 1 ,..., s. 

The sequence is called full if in addition 

(vi) H n (oi, Oi+i,..., Urn) is generated by {hj \ itj > i}, for all 1 < * < m. 

In (j^, note that {hj \ itj > i} consists of those elements having 0 in their 
first i — 1 coordinates. Let us remark here that (T^ holds for a given i if and 
only if the following two properties hold. 

(vi.i) For all 1 < fc < j < s with TTfc < i, hj'‘ and are elements of 

{hi\l>k). 

(vi.ii) For all 1 < fe < s with iik <i and TTfc S T, g {hi\l > k). 

Indeed, Lemma 12.2tlivD implies that , and have coordinates 

I through TTj being 0 so the forward implication is clear. Conversely, given an 
element of FT fl (oi, a^+i,..., am), written as a product of generators of H, one 
may hrst reduce exponents of hi using (vi.ii) (if tti € T), and then, observing 
that the exponent sum of hi must be 0 , eliminate all occurrences of hf^ in 
conjugate pairs using (vi.i). Repeating with h 2 , / 13 ,..., hfe where irk < i < T^k+i, 
we obtain a word in generators {hj \ iTj > i\. 

The importance of full sequences is described in the lemma below, which can 
be found in [33] Props. 9.5.2 and 9.5.3. 

Lemma 3.1. Let H < G. There is a unique full sequence U = (hi ,..., h^) that 
generates H. Further, 

H = {h^^ ■ ■ ■ h^‘ \ Pi Gil and 0 < Pi < if € T} 

and s < m. 

We define three operations on tuples {hi ,..., h„) of elements of G, and the 
corresponding operation on the associated matrix, with the goal of converting 
{hi ,..., hn) to the unique full-form sequence for iJ = {hi ,..., hP). 

(1) Swap hi with hj. This corresponds to swapping row i with row j. 

(2) Replace hi by hih’-j {i jtz j, I g 1). This corresponds to replacing row i by 
Coord(hih*). 

(3) Add or remove a trivial element from the tuple. This corresponds to adding 

or removing a row of zeros; or ([3]) a row of the form (0 ... 0 ai+i ... am), 
where i gT and ■ • ■ a“'". 
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Clearly, all three of these operations preserve H. By combining these operations, 
we may also 

(4) replace hi with and 

(5) append to the tuple an arbitrary product hi^ - ■ ■ hi^ of elements in the tuple. 

Using the row operations defined above, we show how to reduce any coordi¬ 
nate matrix to its unique full form, thus producing the unique full generating 
sequence for the corresponding subgroup H. While it is not difficult to see 
that such reduction is possible, the details of the procedure are essential for our 
complexity estimates. We make use of the following algorithmic fact regarding 
greatest common divisors. 

Lemma 3.2. There is an algorithm that, given integers ai,...,a„ as binary 
numbers, computes in time 0{L^) an expression 

XiOi -I- ... -b Xnttn =d = gcd(ai,..., a„) 

with \xi\ < i max{|ai|,..., |a„|}, where L is the total number of bits in the 
input. If n is fixed, the algorithm may be run in space 0{L). 

Proof. We compute the expression using the binary tree method described in 
[2^ Thm. 9. This computation proceeds in two phases. In the first or ‘bottom- 
top’ phase, we place the integers oi,... ,a„ as the leaves of a binary tree, and 
we compute GCDs of the pairs ( 01 , 02 ), ( 03 , 04 ), ..., (a„_i,a„), recording each 
GCD as its expression as a linear combination of oi and oi+i in the parent node. 
We then continue up the tree computing the GCDs of the pairs of parents in 
the same fashion, obtaining d at the root. This involves invoking the extended 
Euclidean algorithm (or a more efficient algorithm) at most n — 1 times, each 
time with inputs bounded by M = max{|oi|,..., |o„|}. Each invocation runs in 
time 0(log^ M), hence the entire phase runs in time 0{L^). 

In the second or ‘top-bottom’ phase, we compute the coefhcients Xi,... ,Xn 
(satisfying the given bound) from the top of the tree downward, using ‘small’ 
coefficients at each step. Each computation uses a fixed number of arithmetic 
operations, hence this phase also runs in time 0{L^). 

Eor the space complexity, simply observe that when n is fixed the tree has 
constant size and we use logspace arithmetic operations, which must run in time 
polynomial in log L. □ 

Let Aq be a matrix of coordinates, as in m above. We produce matrices 
Ai,..., As, with s the number of pivots in the full form of Aq, such that for 
every k = 1 ,..., s the first columns of Ak form a matrix satisfying (In|-(j^, 
the condition (T^ is satisfied for all i < TTk+i, and Ag is the full form of Aq. 
Here we formally denote 7rs_|_i = m -I-1. 

Set ttq = 0 and assume that has been constructed for some fc > 1. In 
the steps below we construct Ak. We let n and m denote the number of rows 
and columns, respectively, of Ak-i- At all times during the computation, hi 
denotes the group element corresponding to row i of Ak and denotes the 
(z,_))-entry of Ak, which is Goordj(/ii). These may change after every operation. 
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Step 1. Locate the column tt/j of the next pivot, which is the minimum integer 
TTfc-i < TTk < m such that ^ 0 for at least one k < i < n. If no such 
integer exists, then k — l = s and As is already constructed. Otherwise, set 
Ak to be a copy of Ak-i and denote tt = TTfc. Compute a linear expression 
of (i = gcd(afe^,...,a„^), 

d Ik^k-K “t“ * * * “t“ ^n^n-TT- 

The coefficients Zj,,..., must be chosen so that \li\ < M for all i, where 
M = max{|afc,n.|,..., |a„,r|}- Let /i„+i = and note that 

has coordinates of the form 

Coord(Zi„+i) = (0,..., 0, d,...) 

with d occurring in position tt. Perform operation ([S]) to append /in+i as 
row n + 1 of Ak ■ 

Step 2. For each i = k,... ,n, perform row operation m to replace row i by 

Coord(/ii • 

For each i = 1,... ,k — 1, use @ to replace row i by 

Coord(Zii • 

Using (ED, swap row k with row n + 1. At this point, properties (Inll-dr^ 
hold on the first k columns of Ak- 

Step 3. If tt € T, we additionally ensure condition (j^ as follows. Perform row 
operation ((Sj), with respect to tt, to append a trivial element hn +2 with 
Coord(/i„+ 2 ) = (0,..., 0, e^,...) to Afc. Let S = gcd(d, e.^-) and compute 
the linear expression S = ni(i + n 2 e,r, with |ni|,|n 2 | < max{d, Ctt}. Let 
hn +3 = and append this row to Ak, as row n + 3. Note that 

Coord(Zi„+ 3 ) = (0,..., 0, (5,...), with S in position tt. Replace row k by 
Coord(Zifc • and row n + 2 by Coord(Zin +2 • producing zeros 

in column tt in these rows. Swap row k with row n + 3. At this point, (ED, 
(ImT) . and (|^ hold (for the first TTk columns) but (HE) need not, since the 
pivot entry is now S instead of d. For each j = 1,... ,k — 1, replace row j 
by Coord(Zij • ensuring diE - 

Step 4. Identify the next pivot TTk+i, setting TTk+i = m + I if is the last 
pivot. We now ensure condition (ElD for i < TTk+i- Observe that Steps 1-3 
preserve {hj | tt j > i) for all i < TTk- Hence (ElD holds in Ak for i < TTk 
since it holds in Ak-i for the same range. Now consider i in the range 
TTk < i < T^k+i- It suffices to prove (vi.i) for all j > k and (vi.ii) for TTk 
only. 

To obtain (vi.i), we notice that h'j^^hjhk,hkhjh'i^^ € {hi | .^ > fc) if and 
only if [hj,h^^] G {hi \ £ > k). Further, note that the subgroup generated 
by the set 

Sj = {l,hj, [hj, hk ],..., [hj, hk, ■ ■ ■, hk]}. 
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where hk appears m — times in the last commutator, is closed under 
commutation with hk since if hk appears more than m — tt^ times then the 
commutator is trivial. An inductive argument shows that the subgroup 
(Sj) coincides with {h^’‘ | 0 < £ < m — Similar observations can 
be made for conjugation by Therefore, appending via operation m 

rows Coord(h^'“) for all 1 < \£\ < rn — iTk and all k < j < n + 3 delivers 
(vi.i) for all j > k. Note that (vi.i) remains true for i < tt^. 

To obtain (vi.ii), in the case nk G T, we add row Coord(h^‘'^°'‘’”'=). Note 
that this element commutes with hk and therefore (vi.i) is preserved. 


Step 5. Using ([3]), eliminate all zero rows. The matrix Ak is now constructed. 

In applying row operation ([21) or the magnitude of the largest entry 
in the matrix may increase. It is essential to observe that during the matrix 
reduction algorithm the growth of this value is bounded by a polynomial of fixed 
degree (depending on G). 

Lemma 3.3. Let gi,... ,gt G G and let R be the full form of the assoeiated 
matrix of coordinates. Then every entry aij of R is bounded by 

where L = \gi \ + ■ ■ ■ + \gt\ is the total length of the given elements, and K = 
771 ( 80 ^)"* and C are constants depending on G. 

Proof. Denote by Aq the txm matrix of coordinates associated with {gi,..., gt). 
Following the matrix reduction algorithm described above, we will bound the 
entries of Ak in terms of the entries of Ak-i and by induction obtain a bound 
of the entries of R = Ag in terms of the entries of Aq. 

For a given Ak-i, denote by n the number of rows of Ak-i and by N the 
magnitude of the largest entry, i.e.. 


N = max{|ay I |I<j<n., I<j< m}. 


Observe that for each 1 <i <n, the element hi corresponding to row i of Ak-i 
has length \hi\ = |a““ < mN. Now in Step 1 we append the row 

hn+i, which satisfies 

|/7„+l| = \h^^ • • ■ <1 = \lk\\hk\ + ■ • • + \ln\\K\ 

< N(jhk \ + • ■ ■ + \hn\) 

< mnN^. (14) 

Denote by ab the (I,j)-entry at the end of Step 2. Since this number is 

Coordj(/iih~_[:“"^‘^^), except for i = /c, we have, using Theorem l2.3l 


|ab| < ^ 

< nirnN + N ■ mnN'^Y < K(2mnN^) 
= K{2mY ■ 



C 


(15) 
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For i = k, the tighter bound (fHl) holds. 

Proceeding to Step 2, denote E = max{ei \ i gT}. The new rows /i „+2 and 
hn +3 satisfy 

|^n+2| < 2kE‘^ 

l^n+sl = \K"h^l2\ = \ni\\hk\ + \n2\\hn+2\ 

< [EN)(mnN‘^) + {EN){2kE‘^) 

< 2mnKE‘^^^. ( 16 ) 

Let a"- denote the (*,j)-entry of Ak-i at the end of Step 4, and recall that 
Step 4 only appends rows to the bottom of the matrix. For row n + 3 (row k 
before swapping) we have, for all j, 

|Q:"„+3)il < K\hkh~%^\‘^ < K(|/ife| +iV|/i„+3|)'= 

< K{mnN^ + 2mnKE'=+^N‘^y < K‘=+\3mn)‘^E‘=^+‘=N^‘=. ( 17 ) 


In row n + 2 we have 


l'^(n+ 2 )jl — ^ 


,—eTr/S 

ln+ 2 n „_|_3 


< K (|/l„_|_2| + E\hn+3\Y 


< k{2kE^ + 2mnKE^+'^N^f < k‘^+^ . (18) 


Finally, for rows 1 through fc — 1 notice that each of hi,, hk-i has length 
bounded by m times the bound (fT51) . hence 


\a”i \ < K 


u i,“Kv/^l 

hjK 


< K{\hj\ + |a',,.||hfc|)‘^ 

< K {m{ 2 mYnrf + «;(2m)“n°7V^'^ • 2 mnKE'^~^^N^Y 

< { 6 KmnENY‘=^+^^. 


(19) 


Note that at the conclusion of Step 3, bound m applies to rows 1 through 
fc — 1 , bound (1161) applies to the element hk (formerly hn+ 3 ) in row fc, and the 
maximum of CH) and (1181) applies to all rows after fc. 

h‘ 

In Step 4 we append all rows of the type h^'’ for 1 < |Z| < to — and 
k < j < n + 3. The entries of such a row are bounded by 


a'pgl < K < K[\hj\ + 2\l\\hk\Y 


< K {AranY E'^ +2c^4c _|_ 2mm ■ 2mnKE‘^'^^ 

< C 


( 20 ) 


where C = {AnmE)'^^. If S T, we also append the row , 

and so the entries in the final row r of the matrix satisfy 

< K{Em • 2 mnKE^^^. 

<C"-inN)^‘^\ ( 21 ) 
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Thus the magnitude of each entry of Ak is bounded by C ■ where n 

is the number of rows of Ak-\ and N bounds the magnitude of the entries of 

Ak-i- 

Next, notice that Steps 1-3 add three rows, and Step 4 adds less than 
2m(n -|- 3) rows. We may bound the number of rows added by 10m ■ n. Conse¬ 
quently, the number of rows of Ak is bounded by (lOm)^ ■ t. 

A simple inductive argument now shows that every entry of R is bounded 

by 

where Nq is the maximum of the absolute value of entries in Aq and C" is a 
constant depending on m, c, E, and k. Now 

Aq < niax{K|(7i|‘^} < 

i 

Moreover, t < L and s < m. Therefore the entries of R are bounded by 

^ _ j^c{4c^)^4-4c^m 

We simplify the exponent of L by using the bound K = m(8c^)™. 

□ 

Lemma 13.31 allows us to produce a logspace version of the matrix reduction 
algorithm. Note that the matrix reduction algorithm, as presented, may use 
more than logarithmic space since the number of rows t of the initial matrix Aq 
may be of the same order as the total input size. 

Theorem 3.4. Let G be a finitely generated nilpotent group with lower-central 
Mal’cev basis A. There is an algorithm that, given hi,... ,hn & G, computes the 
full form of the assoeiated matrix of coordinates (relative to A) and hence the 
unique full-form sequence {gi,... ,gs) generating (hi,...,/i„). The algorithm 
runs in spaee 0{\ogL), where L = 'Y(^=i \hi\, and in time 0{L\o^ L), and the 
total length of the elements gi,... ,gs is bounded by a polynomial function of L 
of degree m(8c^)™. 

Proof. Algorithm description. Since the coordinate matrix for hi,..., h„ cannot 
be stored in logarithmic space, we adopt a piecewise approach, appending one 
row at a time and reducing. 

Form the coordinate matrix Bq of the first m elements, hi,..., hm and com¬ 
pute its full form Bi. Append to Bi a row corresponding to the coordinates of 
hra+i and compute the full form B 2 of this matrix. Append hm- 1-2 to B 2 and 
continue in this way until there are no more rows to append. Since the subgroup 
generated by the rows is preserved under row operations, the last matrix thus 
obtained, Bn-rm is the full form of the matrix of coordinates of hi,..., h„. 

Space and time complexity. We first show that at every step the matrix we 
work with can be stored in logarithmic space. Since each intermediate matrix 
Bi is in full form, Lemma 13.11 ensures that Bi has at most m rows. Hence the 
number of rows appended the reduction of Bi-i to Bi is, as seen in the proof 
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of Lemma ESI bounded by lOm^. The size of the working matrix is therefore 
never more than lOm^ x m (constant with respect to the input). 

As for the size of the entries, Theorem 12.31 shows that each entry aij of the 
matrix Bq satishes 

\aij\ < K\hi\‘^ < kL'^. 

Each entry can be encoded using O(logL) bits and therefore Bq can be stored 
in logarithmic space. 

Since the matrix Bi, 1 < I < n — m, is precisely the (unique) full-form 
matrix for the sequence hi,..., hm+i, Lemma [3.3l ensures that each entry of Bi is 
bounded in magnitude by C-L^. The proof of Lemma EEl shows that the bound 
given there holds during all steps of the reduction algorithm, hence during the 
reduction of i?;_i to Bi no entry can be greater in magnitude than C{m?CL^)^. 
Consequently, all intermediate matrices can be stored in logarithmic space. 

It remains to show that the operations that we use can also be executed in 
logarithmic space and time 0(Llog^ L). Computing the linear expression of a 
CCD is preformed only on a bounded number of integers (the number being 
bounded by the number of rows, see Step 1 for the worst case), each encoded 
with 0(log L) bits. It follows that the procedure for doing so described in Lemma 
13.21 can be carried out in time O(log^L) and space O(logL). Computation of 
coordinates is performed initially for each hi using Theorem l2.7l Subsequent co¬ 
ordinate computations involve finding the coordinates of a product of a bounded 
number of factors raised to powers that are 0(log L)-bit integers (no larger than 
the greatest entry in the matrix). The coordinates of each factor are known, 
hence the computations are performed in time O(log^L) and space O(logL) 
by Lemma 12.101 The other operations (swapping rows, removing zero rows, 
locating pivot, etc.) are trivial. 

Finally, for each reduction phase (computing Bi from i3/_i), the number 
of the above operations (GCD, coordinates, etc.) is bounded. The number of 
phases is bounded by n < L, hence the time complexity is 0(Llog^ L). □ 

Remark 3.5. The factor of L in the time complextiy arises most heavily from 
the fact that the number n of input elements can, in general, only be bounded 
by L. If n is regarded as a fixed number, the most time-consuming computation 
is computing the Mal’cev coordinates and the overall time complexity is reduced 
to O(Llog^L). This remark also applies to Theorems 13.81 lOTl and 14.11 

Remark 3.6. The output of the algorithm uses binary numbers to encode the 
exponents which are the Mal’cev coordinates of gi. If one is interested in the 
expression of gi as words in generators of G, the time complexity grows according 
to the time it takes to print out the corresponding words. Given the polynomial 
bound on the total length of gi provided in the statement, the overall algorithm 
still runs in polynomial time. This remark also applies to all the search problems 
considered below, that is to Theorems 13.8113.11114.1114.2114.41 and 14.61 

In order to solve the compressed-word version of the membership (and other) 
problems, we require a compressed version of matrix reduction that runs in 
polynomial time. 


25 






Lemma 3.7. The compressed version of Theorem [M in which the input el¬ 
ements hi,... ,hn are given either as straight-line programs or binary Mal’cev 
coordinate tuples, runs in time OinL^) where L is the total input size. Each gi 
is output as a straight-line program or binary Mal’cev coordinate tuple, of size 
polynomial in L. 

Proof. For the compressed-word version, let Ai,..., A„ be the input straight- 
line programs encoding hi,...,hn. We first compute, by Theorem 12.111 the 
coordinate vectors Coord(eval(Ai)) for i = 1,... ,n. This operation occurs in 
time 0{nL^). Since |eval(Ai)| < 2^, we obtain from Theorem 12.31 that each 
entry of Bq is bounded by k ■ (f2.^Y hence is encoded using 0{L), rather 
than O(logL), bits. Therefore the reduction process described in Theorem 13.41 
runs in time 0{nLY. Note that one may instead include all n rows in the initial 
matrix, as in Steps 1-5, instead of the piecewise approach of Theorem l3.41 since 
logspace is not an issue here. 

Since |eval(Ai)| < 2^, it follows from Lemmathat each coordinate in the 
full-form sequence for H is bounded in magnitude by a polynomial function of 
2^. Then each element of the sequence may be expressed as a compressed word 
of size polynomial in L. 

The binary Mal’cev coordinate version simply omits the initial computation 
of coordinates. □ 

3.2 Membership problem 

We can now apply the matrix reduction algorithm to solve the membership 
problem in logspace, and its compressed-word version in quartic time. We also 
solve the membership search problem using only logarithmic space. 

Theorem 3.8. Let G be a finitely generated nilpotent group. There is an algo¬ 
rithm that, given elements hi,..., G G and h G G, decides whether or not h 
is an element of the subgroup H = (hi,..., hn). The algorithm runs in space 
logarithmic in L = |h| -|- 'YZ'i=i 1^*1 time O(Llog^L). Further, if h G H 
the algorithm returns the unique expression h = g^ ■ ■ ■ gf" where {gi,..., gs) is 
the full-form sequence for H (relative to a lower central Mal’cev basis), and the 
length of gj^ ■ • • is bounded by a degree 2771 ( 80 ^)"* polynomial function of L. 

Proof. Algorithm. Compute the full form B of the coordinate matrix corre¬ 
sponding to P[ and the full-form sequence ( 51 ,..., gg). As before, denote by Oy 
the (i, j)-entry of B and by tti, ..., tTs its pivots. 

By Lemma 13.11 any element of ih can be written as gf^ ■ ■ ■ gf^. We show 
how to find these exponents. Denote h^^^ = h and Coord(h*^^^) = (/3p\ ..., 
with h^-l) being defined below. For j = 1,..., s, do the following. If /Jp-* Y 0 
for any 1 < I < nj, then h ^ H. Otherwise, check whether divides . If 
not, then h ^ H. If yes, let 

Ij = and 
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If j < s, continue to j + 1. li j = s, then h = gj^ ■ ■ ■ g]‘ € H if = 1 and 

h ^ H otherwise. 

Complexity and length bound. We first prove, by induction on s, that the 
length of the output gj^ ■ ■ ■ g]‘ is bounded by a degree 


6{s) = o'* + to(8c^) 


,2\m 


E' 

Vi=0 


polynomial function of L. First observe that each gi has length bounded by a 
degree m(8c^)™ polynomial function of L by Lemma 15751 For s = 1 we have by 
Theorem 12.31 


l7i| < < kL^ 


hence h = gj^ has length bounded by a degree c + m(8c^)™ polynomial function 
of L. 

Now assume that gj^ ■ ■ ■ g]Ci has length bounded by a degree (5(s — 1) poly¬ 
nomial function of L. Then • • ■ gi^^\ -b \h\ is also bounded by 

a degree (5(s — 1) polynomial function, so by Theorem 12.31 and therefore 

1 7 sI, is bounded by a polynomial function of degree c ■ 5(s — 1). It follows that 
I is bounded by a degree 


c • (5(s — I) -b m(8c^)’" = S(s) 


polynomial function of L, and hence gj^ ■ g]‘' obeys the same degree bound. The 
bound stated in the theorem may be obtained using s < m. 

Regarding complexity, the initial computation of {gi,... ,gs) is performed 
within the bounds by Lemma |3.41 Since the magnitude of each 7 j is bounded, 
for all j, by a polynomial function of L, the coordinates may be computed 
and stored using logarithmic space and in time 0(log^ L) by Lemma 12.101 The 
complexity bound then follows from the fact that the recursion has constant 
depth s. 

□ 


Though the expression of h in terms of the full-form generators gi,..., gs 
of H provides a standardized representation, one may also wish to express h in 
terms of the given generators. 

Corollary 3.9. Let G be a finitely generated nilpotent group. There is an 
algorithm that, given elements hi,... ,hn € G and h £ {hi,..., h„), computes 
an expression h = where ij £ {1, ... ,n} and Cj = ±1. The algorithm 

runs in time polynomial in L = |h|-bX]i=i o.nd the output has length bounded 
by a polynomial function of L. 

Proof. We modify the matrix reduction algorithm from Section 13.11 so as to be 
able to express each gi as a product oi hi,... ,hn. To this end, along with the 
matrix Ak, we store at every step an array Ck which contains the elements corre¬ 
sponding to the rows of the matrix Ak, each written as a product of hi,..., h„. 
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Thus, Cs will be an array containing in each entry, z, a product of hi,... ,hn 
which is equal to gi. 

To obtain the array Ck from the information in step k — 1, we perform on 
Ck-i the corresponding row operation that was performed on Ak-i, except we 
record the result not in terms of Mal’cev coordinates, but directly in terms of 
the words in the array Ck-i- 

For 1 < A: < s, denote by Lk the length (as a word over the alphabet 
{hf^, ..., h^^}) of the largest entry of Ck- Note that at each step, every row 
has at most two operations performed on it that may increase its length (one 
application of (I2|) in Step 2 and one in Step 3), or is newly-created. Using the 
fact that all exponents involved are bounded in magnitude by C ■ , where 

C and K are the constants from Lemma 13.31 it easily follows that there is a 
poylnomial function f{L) such that 

Lk < f{L)Lk-i. 

Since Lq = 1 and fc < s, it follows that Ls is bounded by a polynomial 
function of L and therefore the computation of Cs is perfomed in polynomial 
time. 

Finally, we can simply substitute the expression of gi (for 1 < z < s) in 
terms of hi,..., into the expression h = gj^ ■ ■ ■ g]^ to obtain an expression 
h = with zi,..., zt G {!,..., n}. Since the length (over the alphabet 

..., of the expression gj^ ■ ■ ■ is bounded by a polynomial function 
of L, producing the expression h = occurs in polynomial time. 

Observe that this method will not yield a logspace algorithm, since the array 
Cs is too large to be stored in memory. □ 

The algorithm used in Theorem 13.81 may be combined with the compressed 
version of matrix reduction (Lemma I3.7|l to give a polynomial-time solution to 
the compressed membership problem. 

Theorem 3.10. Let G be a finitely generated nilpotent group. There is an 
algorithm that, given compressed words Ai,..., A„,]B (or binary Mal’cev coor¬ 
dinate tuples) over G, decides whether or not eval(B) belongs to the subgroup 
generated by eval(Ai),..., eval(A„). The algorithm runs in time 0{nL^), where 
L = |B| -|- |Ai| |A„|. 

As in Theorem 13.81 the algorithm may also compute the unique expression 
of eval(B) in terms of the standard-form seqnence for (eval(Ai),..., eval(A„)). 

3.3 Subgroup presentations 

We now apply matrix reduction to show that finitely generated nilpotent groups 
are logspace effectively coherent: given a finitely generated subgroup H, we can 
in logspace and quasilinear time compute a consistent nilpotent presentation for 
H. By the size of a presentation we mean the number of generators plus the 
sum of the lengths of the relators. 
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Theorem 3.11. Let G he a finitely generated nilpotent group. There is an algo¬ 
rithm that, given hi,... ,hn S G, computes a consistent nilpotent presentation 
for the subgroup H = {hi,..., hn). The algorithm runs in space logarithmic in 
L = X]r=i l5®l time O(Llog^L), the size of the presentation is bounded by 
a degree 2m(8c^)™ polynomial function of L, and binary numbers are used to 
encode exponents appearing in relators of the presentation. 

Proof. Algorithm. Begin by computing the full sequence {gi,..., gs) for H using 
Lemma 13.41 Let Hi = {gi,gi.\.i,... ,gs). We claim that 

H = Hi>H 2...> H,+i = 1 

is a cyclic central series for H. From property (T^ we have 
Hi = H n (UTTj ; ^TT^ + l; • ■ • ; ^m) • 

Since ... ,am) is a normal subgroup of G, it follows that the above is a 
normal series, and since Hi/Hi+i = {giHi+i) the series is cyclic. For i < j, 
[gi,gj] € (a,n.^+i,... ,am) n H < Hj+i hence the series is central. We conclude 
that {gi,..., gs) is a Mal’cev basis for H, so it suffices to compute the relators 
@-(13]) in order to give a consistent nilpotent presentation of H. The order e' 
of Pi modulo iJi+i is simply ei/Coord,ri (ffi)- We establish each relation (@ by 
invoking Theorem 13.81 with input g'^' and iLi+i = {gi+i,... ,gs). Since g*^' G 
iJi+i and (gi+i,... ,gs) is the unique full sequence for iLi+i, the membership 
algorithm returns the expression on the right side of (@. Relations ([3]) and o 
are established similarly. 

Complexity and size of presentation. Computation of the full sequence and 
invoking Theorem l3. 81 occurs within the stated space and time bounds. Regard¬ 
ing the size of the presentation, the number of generators is s < m, the number 
of relations is bounded by s -I- 2 ( 2 ), and each relation has length obeying the 
degree 2m(8c^)"* bound of Theorem 13.81 □ 

Remark 3.12. By Remark 12.41 the Mal’cev basis for H obtained in Theorem 
I3.11l mav be used in Theorem l2.3l Indeed, one may group the Mal’cev generators 
gi,... ,gs into the appropriate terms of the lower central series of G, i.e. set 

Hi = (<7feij Ski+i, ■ • ■ j5s) 

where ki is the least index such that £ F^ \ Fi-i. Then H = H[ > H '2 > 
■ ■■> H’^j^i = 1 is a central series for H in which [Hf iLj] < Rj'+j and gi,... ,gs 
is an associated Mal’cev basis. 

The compressed-word version of Theorem l3.11l running in polynomial time, 
follows immediately. However, the relators of the presentation are provided as 
straight-line programs. This is unconventional, but we may convert to an ‘un¬ 
compressed’ presentation, of polynomial size, using the following construction. 

Consider any presentation R of a group H in which each relator R is the 
output of a straight-line program Ar. We apply the following construction to 
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V. Add the set of non-terminals A of all Ar to the generators of V. For each 
production rule A —>■ BC, add the relation A = BC to the relations of V] for 
each production A ^ x add the relation A = a;; for A —>■ e add the relation 
A = 1. Replace the original relator eval(A/{) by the root non-terminal of An 
and denote the resulting presentation V' . 

It is clear that the sequence of Tietze transformations which at each step 
removes the greatest non-terminal and its production rule converts V' back to 
V. Thus V and V' present isomorphic groups. As the outputs eval(A/j) are, 
in many cases, exponentially longer than the size of the programs A/j, this 
construction significantly reduces the size of the presentation. 

Theorem 3 . 13 . Let G he a finitely presented nilpotent group. There is an al¬ 
gorithm that, given a finite set Ai,...,A„ of straight-line programs over G 
(or binary Mal’cev coordinate tuples), computes a presentation for the sub¬ 
group (eval(Ai),..., eval(A„)). The algorithm runs in time 0{nL^), where 
L = 'Y)h=i o.iT'd the size of the presentation is bounded by a polynomial 
function of L. 

We can also use this construction to remove the binary numbers from the 
presentation found in Theorem 13.111 and at the same time produce a smaller 
presentation. However, the presentation will no longer fit the definition of a 
nilpotent presentation due to the presence of extra relators used for compressing 
the exponents. 

Corollary 3 . 14 . The algorithm of Theorem \3.11\ may he modified to produce 
a presentation of H (without binary numbers) of size logarithmic in L. The 
algorithm runs in logarithmic space and time O(Llog^L). 

Proof. The number of generators and relators is bounded by a constant, as 
observed above. Each power appearing in a relator is compressed using a 
straight-line program of size 0{logj3). Since each exponent (3 is bounded by a 
polynomial function of L, the total size of the straight-line programs, and thus 
of the presentation, is logarithmic in L. □ 

4 Homomorphisms and the conjugacy problem 

Using matrix reduction, |33j showed how to compute the kernel of a homomor¬ 
phism and compute a preimage for a given element under a homomorphism. 
We prove in H4.ll that these algorithms may be run in logarithmic space and 
compressed-word versions in polynomial time. In H4.2l we apply these algorithms 
to solve the conjugacy problem. 

4.1 Kernels 

For fixed nilpotent groups G and H, we may specify a homomorphism from 
a subgroup K < G to H via a generating set (gi,... ,g„) of K and a list of 
elements hi,... ,hn where 4>{gi) = hi, i = 1,... ,n. For such a homomorphism. 
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we consider the problem of finding a generating set for its kernel, and given 
h G (j){K) finding g £ G such that (j){g) = h. Both problems are solved using 
matrix reduction iir the group H x G. 

Theorem 4.1. Fix finitely generated nilpotent groups G and F[, with lower- 
central Mal’cev basis A and B, and let c be the sum of their nilpotency classes 
and TO = |A| + \B\. There is an algorithm that, given 

• a subgroup K = {gi,..., g^) < G, 

• a list of elements hi,... ,hn regarded as defining a homomorphism f : 
K ^ H via 4){gi) = hi, and 

• optionally, an element h £ H guaranteed to be in the image of 4>, 
computes 

(i) a generating set X for the kernel of 4>, and 
(a) an element g £ G such that 4){g) = h. 

The algorithm runs in space logarithmic in L = |/i| +X]r=i(l^*l + If?*!) time 
0{L\og^ L), X consists of at most m elements, and there is a degree to(8c^)’" 
polynomial function of L that bounds the length of each element of X and a 
degree 2to(8c^)"* polynomial function of L that bounds the length of g. 

Proof. Let Ci be the nilpotency class of G and C 2 that of H and consider the 
nilpotent group H x G. From the lower central series H = ro(-ff) > ri(iJ) > 
...>Tc^{H) = loiH and G = ToiG) >Ti{G) > ...> VcAG) = 1 of G, form 
ioT H X G the central series 

iL X G = Ao > Ai > ... > Ae = 1 (22) 

where A^ = ri{H)G if 0 < * < C 2 and A^ = ri_c 2 (G) if C 2 < i < c = C 2 + Ci. 
Notice that this series has the property that [Aj,Aj] < A^+j, since both 
lower central series have this property and the subgroups H and G commute 
in H X G. Letting B = {bi,... ,bm 2 ) and A = (oi,..., Omi), we see that 
{bi,, bm 2 ,cii,..., ami) is a Mal’cev basis associated with (|2^ for which The¬ 
orem 12.31 applies. 

Let Q = {hipi 11 < * < n) and P = {(j){k)k \ k £ K}. We claim that Q = P. 
First, observe that P is a subgroup of x G. Now, each generator hipi of Q 
is in P, therefore Q < P. Conversely, if (b{k)k £ P, then k = qA ■ ■ ■ P* hence 
(j){k)k = • • • {hi^Pit)^* £ Q. 

Let W = {viUi,... ,VsUs) be the sequence in full form for the subgroup Q, 
where Ui £ G and Vi £ H. Let 0 < r < s be the greatest integer such that ^ 
(with r = 0 if all Vi are 1). Set X = {ur+i ,..., Un) and Y = {vi,..., vf). We 
claim that X is the full-form sequence for the kernel of f and Y is the full-form 
sequence for the image. 

From the fact that W is in full form, it follows that both X and Y are in 
full form. Since Q = P, it follows that Vi = (j){ui) for i = 1,..., s. Hence X is 
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contained in the kernel and Y in the image. Now consider an arbitrary element 
(j){g)g of Q. There exist integers /3i,..., /3s such that 

If (j){g) is any element of the image, the above shows that (j){g) = ■ ■ •uf'’, 

hence Y generates the image. If g is any element of the kernel, then 1 = 
4’{g) = • ■ • I'r'’- Since the integers /3i,..., /3s are unique (taking 0 < /3i < Ci 

when Ci G T), we have /3i = • • • = /3r. = 0. Consequently g = • • • u^‘‘ = 

, hence X generates the kernel. 

Then to solve ((11) , it suffices to compute W using Lemma 13.41 and return X 
(or {1} if r = s). To solve (HU, use Theorem 13.81 to express h as h = • • ■v^'' 

and return g = Ui^ ■ ■ ■ u^''. The length and complexity bounds are as given in 
Lemma [3^ and Theorem 13.81 □ 

We can apply Theorem 14.11 to solve the search version of the word problem 
in logspace and quasilinear time. 

Theorem 4.2. Let G be a finitely presented nilpotent group. There is an al¬ 
gorithm that, given a word w in generators of G which represents the identity 
element, produces a word w' written as a product of conjugates of relators of G 
such that w' is freely eguivalent to w. The algorithm runs in space logarithmic 
in L = jwl and time 0{L\og^ L), and the length of w' is bounded by a degree 
2m(8c^)'" polynomial function of L. 

Proof. Let G = (X|i?). Denote by F{X) the free nilpotent group on X of 
class c, by N the normal closure of R in F{X), and by (j> the canonical epi- 
morphism F{X) —?► G. Using Theorem 14.11 compute the full-form sequence 
Y = {yi,... ,ys) for N = ’kerf. By exhaustive search, write each yi as a prod¬ 
uct of conjugates of elements of R. Note that this step is a precomputation. 

Since w represents the trivial element, w is in N, so we run the algorithm 
from Theorem 13.81 in order to express w in the form w = yf^ ■ ■ ■ y]^. Then 
substitute the expression of each pi as a product of conjugates of relators of G. 

The space complexity, time complexity, and length bound are as given in 
Theorem 13.81 noting that N is fixed and hence Remark 13.51 applies giving 
0{L log^ L) rather than 0{L log^ L). □ 

In order to solve the compressed conjugacy problem, we need a compressed 
version of Theorem 14.11 in which all input and output words are encoded as 
straight-line programs. We follow the same algorithmic steps, using Lemma 
liLTl to compute W in Mal’cev coordinate form and Theorem 13.101 to compute 
j3i,. . . , Pr. 

Theorem 4.3. The compressed version of Theorem EH in which all input 
words are given as straight-line programs or binary Mal’cev coordinate tuples 
and the output is given in the same form, runs in time 0(nL^). 
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4.2 Conjugacy problem and centralizers 

The decidability of the conjugacy problem in finitely generated nilpotent groups 
has been known since m and [7] proved that every polycyclic-by-finite group 
is conjugately separable: if two elements are not conjugate, there exists a finite 
quotient in which they are not conjugate. While this fact leads to an enumera- 
tive solution to the conjugacy problem, a much more practical approach, using 
matrix reduction and homomorphisms, was given in |33| . 

The computational complexity of this algorithm, however, was not ana¬ 
lyzed. We show here that it may be run using logarithmic space, and that 
the compressed-word version runs in polynomial time. A necessary step in the 
solution is the computation of centralizers, which is achieved by induction on 
the nilpotency class c of G. 

Theorem 4.4. Let G be a finitely generated nilpotent group. There is an algo¬ 
rithm that, given g G G, computes a generating set X for the centralizer of g in 
G. The algorithm runs in space logarithmic in L = |g| and time O(Llog^L), 
X contains at most m elements, and there is a degree (16m(c -I- ij2^3(c+i)m 
polynomial function of L that bounds the length of each element of X. 

Proof. Let G = ro >ri > ... > Lc+i = 1 be the lower central series of G (or 
another series satisfying cni)). We proceed by induction on c. If c = 1, then 
G is abelian and G{g) = G so we return an, ai 2 ,. ■., aimj. Assume that the 
theorem holds for finitely presented nilpotent groups of class c — 1, in particular 
for G/Tc. We use the series for G/T^. 

Compute a generating set K = {fciTc,..., km-ma^c\ for the centralizer of 
gTc in G/Tc. Let J = (fci,.. ., k,n-m„,aci, ac2, ■. ■, OcmJ, which is the preimage 
of {K) under the homomorphism G —)• G/Lc. Define / : J —J- G by 

f{u) = [g,u]. 

Since u G J, u commutes with g modulo Tc, hence [g, a] G Tc and so Im(/) C Lc. 
We claim that / is a homomorphism. Indeed, 

f{g,uiU2) = b,'Ui'U 2 ] = [g,U2][g,ui][[g,ui],U2], 

but G Lc hence [[(;,ui],M 2 ] G Tc+i = 1, and [g,ui\ and [ 5 , 1 x 2 ] commute, 

both being elements of the abelian group T^. 

The centralizer of g is precisely the kernel of / : J —> Lc, since if h commutes 
with 5 , then hTc G (K) so h G J. We compute a generating set using Theorem 

EH 

Complexity and length bound. We proceed to prove the length bound by in¬ 
duction on c. For c = 1, the algorithm returns a single symbol. In the inductive 
case, the algorithm is invoked with input gTc, which has size L. Each returned 
generator ki has length bounded by some number k which is a polynomial func¬ 
tion of L of degree Since for each i = I,..., m — me, [g, fci] has 

length at most 4 k, each input-output pair [ki, [g, ki]) or [aej, [g, aej]) has length 
at most 5 k hence the total input to the kernel algorithm of Theorem 14.11 has 
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length bounded by 5mK. The length of the Mal’cev basis of Tc is nic < m, and 
its nilpotency class is 1, so we use 2m and c +1 in place of m and c in the degree 
bound in Theorem HU Hence the length of each output generator has length 
bounded by a polynomial function of L of degree 

• 2m(8(c+ 1)2)2 '" ^ (16to(c + l)2)3('’+i)™, 


as required. 

The logarithmic space bound follows from the fact that in each recursive 
call, the number c of which is constant, the total size of the input is bounded by 
a polynomial function of L and the kernel algorithm runs in logarithmic space. 
The time complexity of 0{Llog^ L) arises entirely from the computation of the 
Mal’cev coordinates of gT,.- Indeed, the total bit-size of the coordinate tuple of 
g is logarithmic in L and each invocation of the kernel algorithm is made with 
at most m elements of bit-size logarithmic in a polynomial function of L, hence 
logarithmic in L, so each of the c recursive calls terminates in time 0(log^ L) 
by Theorem 14.31 

□ 

We also require a compressed version of Theorem 14.41 The algorithm is the 
same, with a time complexity of 0{L^) arising both from the initial computation 
of Mal’cev coordinates and the fact that Theorem 14.31 is invoked with at most 
m elements of bit-size 0{L). 

Theorem 4.5. Let G be a finitely generated nilpotent group. There is an al¬ 
gorithm that, given a straight-line program A over G (or a binary Mal’cev co¬ 
ordinate tuple), computes in time 0{L^) a generating set of the centralizer of 
eval(A) in G, where L = |A|. The generators are given as straight-line programs 
(or binary Mal’cev coordinate tuples) of size polynomial in L. 

We may now solve the conjugacy and conjugacy search problems in logspace, 
again by induction on the nilpotency class of G. 

Theorem 4.6. Let G be a finitely presented nilpotent group. There is an algo¬ 
rithm that, given g,h G G, either 

(i) produces u G G such that g = u~^hu, or 
(ii) determines that no such element u exists. 

The algorithm runs in space logarithmic in L = |g| -|- \h\ and time 0{Llog^ L), 
and the word length ofu is bounded by a degree 2”’“ (fimc^)"’' polynomial function 
of L. 

Proof. Algorithm. We proceed by induction on c. If c = 1, then G is abelian 
and g is conjugate to h if and only if g = h. If so, we return u = 1. 

Now assume c > 1, and that the theorem holds for any nilpotent group of 
class c — I, in particular for G/Tc- Apply the algorithm to gTc and hVc, using 
the series {Ti/FcD’^Q for G/Tc- If these elements are not conjugate, then g and 
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h are not conjugate and we return ‘No’. Otherwise, we obtain tiFc S G/Tc- 
such that gVc = h'^Tc- 

Let (f) : G ^ G/Tc be the canonical homomorphism, J = (j>~^{G{gTc)), and 
define /: J —)• Lc by f{x) = [g,x]. As in the proof of Theorem 14.41 the image 
of / is indeed in Tc and / is a homomorphism. We claim that g and h are 
conjugate if and only if g~^h'" e f{J)- Indeed, if there exists w G G such that 
g = h”'", then 

1 • Tc = g~^w~^h'’w ■ Tc = [g, w] ■ 

hence w £ J, so w~^ S J as well. Then g~^h^ = £ f{J), as required. 

The converse is immediate. So it suffices to express, if possible, g~^h'" as [g^w] 
with w £ J,\n which case the conjugator is it = vw~^. 

Compute a generating set {wiTc, ■ ■ ■ jWm-mcJ'c} for C{gTc) using Theo¬ 
rem Then J is generated by {ici,..., Wm-m,a^ o-ci, a,c 2 , ■ ■ ■, Ocmc}- Compute 
Coord(g“^h'') and use Theorem Id.lOl to determine if g~^h'’ £ f{J) and if so use 
Theorem 14.31 to find w £ G such that g~^h'" = f{w). Return u = vw~^. 

Complexity and length of u. We first prove the length bound by induction on 
c. For c = 1, we have it = 1. In the inductive case, each Wi has length bounded 
by a degree polynomial function of L and v has length bounded by a 

degree (16mc^)"* polynomial function of L. Take <5 = ( 16177 , 0 ^)^“™, which bounds 
both of these degrees. Then the input to Theorem l4.ll which consists of the pairs 
(wi, [g, iCi]) for 1 = 1 ,..., m — me, the pairs (ocj , [g, Ocj]) for j = 1 ,..., and 
the element g~^h'", also has length bounded by a degree 6 polynomial function 
of L. Hence w has length bounded by a polynomial function of degree 

(16mc3)3‘=™ . 2 m( 8 (c-k 1 )^)™ < (16m(c-k 1 )^)^^'=+^)™. 

Since this is greater than the degree bound for i;, the output u = vw~^ satisfies 
this degree bound. 

Logarithmic space complexity follows immediately from the fact that the 
conjugator length only grows by a polynomial function of L and the depth of the 
recursion is constant. The time complexity arises entirely from the computation 
of Mal’cev coordinates, in Theorem l4.4l and of g~^h^. Indeed, Theorems Id.lOl are 
each invoked with a constant number of inputs, each having bit-size O(logL), 
and therefore their time complexity is 0(log^ L). □ 

We solve the compressed version of the conjugacy problem in the same way, 
using the compressed version of the centralizer algorithm. 

Theorem 4.7. Let G he a finitely generated nilpotent group. There is an 
algorithm that, given two straight-line programs A and B over G (or binary 
Mal’cev coordinate tuples), determines in time 0{L^) whether or not eval(A) 
and eval(B) are conjugate in G, where L = |A| -|- |B|. If so, a straight-line 
program over G of size polynomial in L producing a conjugating element is re¬ 
turned. 

Our solution to the conjugacy probelm in nilpotent groups allows to take 
advantage of results in [4], where Bumagin showed that a group G hyperbolic 
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relative to finitely generated subgroups Pi,... ,Pk has polynomial-time solv¬ 
able conjugacy (search) problem whenever the parabolic subgroups Pi,... ,Pk 
have polynomial-time solvable conjgacy (search) problem. Together with The¬ 
orem [5^ this immediately gives the following result. 

Theorem 4.8. Let G be a finitely generated group hyperbolic relative to finitely 
generated nilpotent subgroups Pi,..., Pk- Then the word problem, the eonjugacy 
problem, and the conjugacy search problem in G can he solved in polynomial 
time. Moreover, the time complexity of the word and conjugacy problems is 
0{L^ log L), where L is the length of input. 

Proof. Note that [H Theorem 1.5] allows to obtain specific estimates on the 
time complexity of the above algorithms in terms of the complexity of word and 
conjugacy (search) problems in Pi,... ,Pfc. In a given nilpotent group, on an 
input of length L, the time complexity of word problem is 0{L) by [12j . and 
that of a conjugacy problem is O(ilog^i) by Theorem 14.61 By [U Theorem 
1.5], we obtain that the time complexity of conjugacy problem in G is bounded 
by 

T{L) = max{0(P log^ P), 0(P^ • L ■ logP)} = 0{L^ log L), 
where L is the length of the input. 

The corresponding estimate in the case of conjugacy search problem, by [U 
Theorem 5.10] is given by 


max{T(P),0(C's(P))}, 

where T{L) is as above, and Gs{L) is a bound on the time complexity of con¬ 
jugacy search problem in all parabolic subgroups Pi. In our case, that gives a 
polynomial of a higher degree, according to Remark 13.61 □ 

5 Presentation-uniform algorithms 

The algorithms presented in the previous sections do not include the nilpotent 
group G as part of the input. We now consider problems which do take G as 
part of the input. Let J\fc be the class of nilpotent groups of nilpotency class at 
most c. We consider groups in A/j, presented using at most a fixed number, r, of 
generators. First, we use Theorem l3.4l to find consistent nilpotent presentations 
for such groups. 

Proposition 5.1. Let c and r be fixed integers. There is an algorithm that, 
given a finite presentation with r generators of a group G in Nc, produces a 
consistent nilpotent presentation of G and an explicit isomorphism, in space 
logarithmic in the size L of the given presentation. Further, the size of the 
presentation is bounded by a polynomial function of L and if binary numbers 
are used in the output relators then the algorithm runs in time O(Llog^L). 

Proof. Let G be presented duS G = {X \ R). Let F = F{X) be the free nilpotent 
group of class c on generators X. As a precomputation, produce a consistent 
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nilpotent presentation (Y \ S) for F. One may do this in such a way that X CY 
and elements of F\X are iterated commutators (so-called ‘basic commutators’) 
of elements of X. Consider the natural surjection (p : F ^ G and let N = ker{(j)), 
which is the normal closure of R in F. Denoting R = {ri,..., r^}, N is generated 
by iterated commutators [... xi], 0 : 2 ],..., Xj], where i = 1 ,..., fc, j < c, and 
a:i,..., ccj & X VJX~^. The total length of these generators is linear in L since c 
and r are constant. We now produce this generating set and apply Theorem l3.4l 
in F with this set, producing the full-form sequence T for N. 

Now G ~ (Y I S' U T), and we claim that this is a consistent nilpotent 
presentation. Since {Y | S) is a nilpotent presentation and the elements of 
T add relators of the form m, the presentation is nilpotent. To prove that 
it is consistent, suppose some yt € Y has order modulo {yt+i,... ,ym) in 
(F I S U T). Since the order is infinite in F, there must be element of the form 
• ■ ■ 2 /m'" in N. But then, by Lemma [3.11 T must contain an element 

y‘i‘y'iPX^ ■ ■ ■ 2 /m" where a' divides ai. Hence ai cannot be smaller than a' and 
so the presentation is consistent. 

The space and time complexity, and the polynomial bound on the size of 
the presentation, follow immediately from Theorem 13.41 since c is fixed and m 
depends only on c and r. □ 

If G is restricted to the class Me and presented with a fixed number r of 
generators, the above result can be employed to produce presentation-uniform 
versions of our algorithms for problems (|I))- (|Vl|) (see list on p. a that run in 
logarithmic space and quasi-linear time. 

Theorem 5.2. Let H denote any of the problems w-mi- For all c, r G N, 
there is an algorithm that, given a finite presentation {X\R) with |X| < r of a 
group in Me and input of II as words over X, solves H in on that input in 

logarithmic space and quasi-linear time. In the case of m, the second group 
may he specified in the input by a presentation but must also be in Me and use 
r generators. 

Proof. Let L denote the total input size. By Proposition 15.11 we can produce a 
consistent nilpotent presentation for G, of size polynomial in L, in logarithmic 
space and time 0{L\og^ L). The number of generators m of this presentation 
depends only on c and r, as it is equal to the number of generators of the 
precomputed nilpotent presentation of the free nilpotent group F of class c. 

We compute, in advance, the functions pi and qi for any nilpotent presenta¬ 
tion on m generators with the exponents otiji, Piji, yu, appearing as variables 
(see the discussion following Lemma 12.2|) . Substituting the correct values for 
these exponents (obtained from the nilpotent presentation of G) we obtain the 
functions pi and qi for the specific group G. Note that the magnitude of all the 
numbers aiji, fdiji, fiu, et is bounded by a polynomial function of L, hence they 
may be encoded as 0 (log L)-bit numbers. 

We note that Theorem 12.31 is satisfied for the obtained presentation, by 
Remark l2.4l The constant k depends on the presentation of G, but may be seen 


37 


from the proof of Theorem 12.31 to be bounded by a polynomial function of L 
of degree depending on c and r. Hence the magnitude of each coordinate of a 
word w is bounded by a polynomial function of L of constant degree. One may 
substitute this bound in subsequent arguments in place of Atlwj* without affecting 
the logspace or quasi-linear time nature of the computations. In Lemma 13.31 the 
constants C and K will be greater, but remain bounded by a constant depending 
on c and r. 

Specifically, the algorithms of Theorems 12.7113.4113.8113.11114.1114.4114.61 as 
well as Corollaries 13.91 and 13.141 run in logspace and time O(Llog^L) when G 
(from class Me with r generators) is included in the input. The length bounds 
given in these results do not hold as stated, but do hold with polynomials of 
some higher (but still constant for fixed c and r) degree. 

□ 
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